智能论文笔记

Two-Step Color-Polarization Demosaicking Network

Vy Nguyen , Masayuki Tanaka , Yusuke Monno , Masatoshi Okutomi

分类：计算机视觉

2022-09-13

场景中光的极化信息对于各种图像处理和计算机视觉任务很有价值。平面偏光仪是一种有前途的方法，可以一次性地捕获不同方向的极化图像，而它需要颜色极化的表现。在本文中，我们提出了一个两步的颜色偏振化学网络〜（TCPDNET），该网络由两个颜色的表演和极化演示组成。我们还引入了YCBCR颜色空间中的重建损失，以提高TCPDNET的性能。实验比较表明，TCPDNET在极化图像的图像质量和Stokes参数的准确性方面优于现有方法。

translated by 谷歌翻译

PoF: Post-Training of Feature Extractor for Improving Generalization

Ikuro Sato , Ryota Yamada , Masayuki Tanaka , Nakamasa Inoue , Rei Kawakami

分类：机器学习

2022-07-05

经过深入的研究，最低限度的损失景观的局部形状，尤其是平坦度对于深层模型的概括起重要作用。我们开发了一种称为POF的培训算法：特征提取器的训练后培训，该培训更新了已经训练的深层模型的特征提取器部分，以搜索最小的最小值。特征是两倍：1）特征提取器在高层参数空间中的参数扰动下受到训练，基于表明使更高层参数空间变平的观测值，以及2）扰动范围以数据驱动的方式确定旨在减少由正损失曲率引起的一部分测试损失。我们提供了理论分析，该分析表明所提出的算法隐含地减少了目标Hessian组件以及损失。实验结果表明，POF仅针对CIFAR-10和CIFAR-100数据集的基线方法提高了模型性能，仅用于10个上学后培训，以及用于50个上学后培训的SVHN数据集。源代码可用：\ url {https://github.com/densoitlab/pof-v1

translated by 谷歌翻译

Co-evolving morphology and control of soft robots using a single genome

Fabio Tanaka , Claus Aranha

分类：人工智能

2022-12-22

When simulating soft robots, both their morphology and their controllers play important roles in task performance. This paper introduces a new method to co-evolve these two components in the same process. We do that by using the hyperNEAT algorithm to generate two separate neural networks in one pass, one responsible for the design of the robot body structure and the other for the control of the robot. The key difference between our method and most existing approaches is that it does not treat the development of the morphology and the controller as separate processes. Similar to nature, our method derives both the "brain" and the "body" of an agent from a single genome and develops them together. While our approach is more realistic and doesn't require an arbitrary separation of processes during evolution, it also makes the problem more complex because the search space for this single genome becomes larger and any mutation to the genome affects "brain" and the "body" at the same time. Additionally, we present a new speciation function that takes into consideration both the genotypic distance, as is the standard for NEAT, and the similarity between robot bodies. By using this function, agents with very different bodies are more likely to be in different species, this allows robots with different morphologies to have more specialized controllers since they won't crossover with other robots that are too different from them. We evaluate the presented methods on four tasks and observe that even if the search space was larger, having a single genome makes the evolution process converge faster when compared to having separated genomes for body and control. The agents in our population also show morphologies with a high degree of regularity and controllers capable of coordinating the voxels to produce the necessary movements.

translated by 谷歌翻译

DiffG-RL: Leveraging Difference between State and Common Sense

Tsunehiko Tanaka , Daiki Kimura , Michiaki Tatsubori

分类：自然语言处理 | 人工智能

2022-11-29

Taking into account background knowledge as the context has always been an important part of solving tasks that involve natural language. One representative example of such tasks is text-based games, where players need to make decisions based on both description text previously shown in the game, and their own background knowledge about the language and common sense. In this work, we investigate not simply giving common sense, as can be seen in prior research, but also its effective usage. We assume that a part of the environment states different from common sense should constitute one of the grounds for action selection. We propose a novel agent, DiffG-RL, which constructs a Difference Graph that organizes the environment states and common sense by means of interactive objects with a dedicated graph encoder. DiffG-RL also contains a framework for extracting the appropriate amount and representation of common sense from the source to support the construction of the graph. We validate DiffG-RL in experiments with text-based games that require common sense and show that it outperforms baselines by 17% of scores. The code is available at https://github.com/ibm/diffg-rl

translated by 谷歌翻译

Hibikino-Musashi@Home 2018 Team Description Paper

Yutaro Ishida , Sansei Hori , Yuichiro Tanaka , Yuma Yoshimoto , Kouhei Hashimoto , Gouki Iwamoto , Yoshiya Aratani , Kenya Yamashita , Shinya Ishimoto , Kyosuke Hitaka

分类：机器人

2022-11-09

Our team, Hibikino-Musashi@Home (the shortened name is HMA), was founded in 2010. It is based in the Kitakyushu Science and Research Park, Japan. We have participated in the RoboCup@Home Japan open competition open platform league every year since 2010. Moreover, we participated in the RoboCup 2017 Nagoya as open platform league and domestic standard platform league teams. Currently, the Hibikino-Musashi@Home team has 20 members from seven different laboratories based in the Kyushu Institute of Technology. In this paper, we introduce the activities of our team and the technologies.

translated by 谷歌翻译

Fashion-Specific Attributes Interpretation via Dual Gaussian Visual-Semantic Embedding

Ryotaro Shimizu , Masanari Kimura , Masayuki Goto

分类：计算机视觉 | 机器学习

2022-10-28

Several techniques to map various types of components, such as words, attributes, and images, into the embedded space have been studied. Most of them estimate the embedded representation of target entity as a point in the projective space. Some models, such as Word2Gauss, assume a probability distribution behind the embedded representation, which enables the spread or variance of the meaning of embedded target components to be captured and considered in more detail. We examine the method of estimating embedded representations as probability distributions for the interpretation of fashion-specific abstract and difficult-to-understand terms. Terms, such as "casual," "adult-casual,'' "beauty-casual," and "formal," are extremely subjective and abstract and are difficult for both experts and non-experts to understand, which discourages users from trying new fashion. We propose an end-to-end model called dual Gaussian visual-semantic embedding, which maps images and attributes in the same projective space and enables the interpretation of the meaning of these terms by its broad applications. We demonstrate the effectiveness of the proposed method through multifaceted experiments involving image and attribute mapping, image retrieval and re-ordering techniques, and a detailed theoretical/analytical discussion of the distance measure included in the loss function.

translated by 谷歌翻译

Cem Mil Podcasts: A Spoken Portuguese Document Corpus

Edgar Tanaka , Ann Clifton , Joana Correia , Sharmistha Jat , Rosie Jones , Jussi Karlgren , Winstead Zhu

分类：自然语言处理

2022-09-23

本文档描述了Spotify出于学术研究目的发布的葡萄牙语播客数据集。我们概述了如何采样数据，有关集合的一些基本统计数据，以及有关巴西和葡萄牙方言的分发信息的简要信息。

translated by 谷歌翻译

On the Adversarial Transferability of ConvMixer Models

Ryota Iijima , Miki Tanaka , Isao Echizen , Hitoshi Kiya

分类：机器学习

2022-09-19

深度神经网络（DNN）众所周知，很容易受到对抗例子的影响（AES）。此外，AE具有对抗性可传递性，这意味着为源模型生成的AE可以以非平凡的概率欺骗另一个黑框模型（目标模型）。在本文中，我们首次研究了包括Convmixer在内的模型之间的对抗性转移性的属性。为了客观地验证可转让性的属性，使用称为AutoAttack的基准攻击方法评估模型的鲁棒性。在图像分类实验中，Convmixer被确认对对抗性转移性较弱。

translated by 谷歌翻译

Real-to-Sim: Deep Learning with Auto-Tuning to Predict Residual Errors using Sparse Data

Alexander Schperberg , Yusuke Tanaka , Feng Xu , Marcel Menner , Dennis Hong

分类：机器人 | 机器学习

2022-09-07

实现接近真实机器人的高度准确的运动学或模拟器模型可以促进基于模型的控制（例如，模型预测性控制或线性质量调节器），基于模型的轨迹计划（例如，轨迹优化），并减少增强学习方法所需的学习时间。因此，这项工作的目的是学习运动学和/或模拟器模型与真实机器人之间的残余误差。这是使用自动调节和神经网络实现的，其中使用自动调整方法更新神经网络的参数，该方法应用了从无味的Kalman滤波器（UKF）公式进行方程式。使用此方法，我们仅使用少量数据对这些残差错误进行建模 - 当我们直接从硬件操作中学习改善模拟器/运动学模型时，这是必要的。我们演示了关于机器人硬件（例如操纵器组）的方法，并表明，通过学习的残差错误，我们可以进一步缩小运动学模型，模拟和真实机器人之间的现实差距。

translated by 谷歌翻译

On the Transferability of Adversarial Examples between Encrypted Models

Miki Tanaka , Isao Echizen , Hitoshi Kiya

分类：计算机视觉

2022-09-07

深度神经网络（DNN）众所周知，很容易受到对抗例子的影响（AES）。此外，AE具有对抗性转移性，即为源模型傻瓜（目标）模型生成的AE。在本文中，我们首次研究了为对抗性强大防御的模型的可传递性。为了客观地验证可转让性的属性，使用称为AutoAttack的基准攻击方法评估模型的鲁棒性。在图像分类实验中，使用加密模型的使用不仅是对AE的鲁棒性，而且还可以减少AES在模型的可传递性方面的影响。

translated by 谷歌翻译